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Abstract 

Adaptive MCMC are a new class of MCMC procedure that have been 
recently proposed in literature that accentuate the convergence of the chain 
to the target distribution. However, since the transition kernel is not same 
at each iteration, the convergence is more difficult to show, 0. In an ear- 
lier paper by Basak et al, [1], applying diffusion approximation, the authors 
arrived at a diffusion governing the dynamics of a suitably defined AMCMC 
for an arbitrary target density The resulting diffusion was more easy 

to handle compared to its discrete counterpart. In this paper we study the 
diffusion when the target distribution is standard Normal. Although this is a 
degenerate one, it satisfies Hormander's hypoellipticity condition and hence it 
has positive density on its support. Next, under some assumptions, we show 
it has a unique invariant distribution whose marginal distribution is Normal. 
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Hypoelliptic conditions, Ito's Lemma, MCMC. 
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1 Introduction 

Markov chain monte Carlo (MCMC) methods are a class of algorithm used to sim- 
ulate a sample from an arbitrary distribution known only upto a constant. One of 
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the algorithms belonging to this class is the Random Walk Metropolis-Hastings (RW 
MH) sampler. The method involves choosing a Markov chain such that the (unique) 
invariant distribution is the target density of interest. This is done by choosing a 
proposal density, from which simulating a sample is possible, and then accepting the 
generated sample with a certain probability (called the MH acceptance probability). 
For more information see [Sj. 

One disadvantage of this method is that the speed of convergence depends on the 
proposal density. Adaptive MCMC (AMCMC) have been devised to counter this 
problem. Here the parameter (s), which will typically be scaling constants in the 
proposal density, are a function of the previous samples that have been generated so 
far. Hence, the proposal density changes at each iteration. This should be done in 
such a way that the scaling constants involved in the proposal density are the best 
possible choices in some sense. 

Although AMCMC is better suited for simulation purposes it is important that 
the limiting distribution is indeed the target distribution (also called the ergodic 
property in the MCMC literature , see for example [8]). Since the transition ker- 
nel changes at each iteration, verifying ergodicity sometimes becomes complicated. 
Rosenthal et al ([9]) proved some sufficient condition for ergodicity, viz., diminishing 
adaptation condition, where the difference between the total variation norm of the 
kernels at the i*'' and the {i + iy^ iteration should go to zero, as i goes to infinity, and 
the simultaneous uniform ergodicity condition where time to convergence starting 
from any point is uniformly bounded over the starting points. Very often, verifying 
this condition for a markov chain (even with simple target density, say Normal) can 
be quite involved. 

This paper approaches the problem from a different standpoint. By applying the 
diffusion approximation scheme, together with an auxiliary variable, to the discrete 
chain we convert it to a continuous time two dimenisonal diffusion process. Our gain 
by such an enterprise is that we can then invoke results in literature for Markov pro- 
cesses to infer about its invariant distribution whose marginal can then be possibly 
identified with the target distributionof the MCMC. Sometimes this can be done 
easily when compared to the discrete time setting. 

In a different context, Gelman et al, (see |1]), applied this technique to a M H al- 
gorithm, whose target distribution was multivaiate normal with iid normal compo- 
nents.. They used the resulting stochastic differential equation (SDE) to recommend 
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choices of the tuning parameter that result in fastest convergence of the diffusion to 
the target distribution. Later the results were exteded by Bedard et al [2] . However 
all the above papers deal with standard MCMC, where the transition kernel was 
fixed at each iteration. To the best of our knowledge, this (along with its com- 
panion paper [1]) is the first instance where diffusion approximation is applied to 
AMCMC. 

This paper is arranged as follows. Section [2] contains the definition of the AMCMC 
and briefly mentions the diffusion approximation procedure done in [1]. Section [3] 
contains the main result (Theorem [2]) of this paper, i.e., existence of the invariant 
distribution of the process along with the identification of the target distribution. 
The various subsections of Section [3] contributes to the proof of Theorem O In 13.11 
we show that the process is tight. This combined with the hypoelliptic condition in 
13.21 shows that the process admits a smooth invariant distributon. After establishing 
moment conditions of the variables under consideration in Section 13.31 and Section 
13. 4^ identification of the target distribution is proved in Section 13.51 We end with 
some concluding remarks in |H 

2 Definitions 

We define the AMCMC in such that the scaling parameter in the Normal proposal 
density is a function of whether the previous sample was accepted or not (ideally it 
should not depend only on the previous sample but on the whole sequence of sample 
that has been generated, but computations become more extensive in that case). 
Here we formaly define our algorithm: 

1. Select arbitrary {Xq, 9q} G 7^ x [0, oo) where 71 is the state space. Set n = 1. 

2. Propose a new move say Y where 

3. Accept the new point with probability a(X„_i, Y) = min{l, ^p^'^''^-) } 
If the point is accepted set Xn = Y, = l; else Xi = C,i = 

4. On = ^„_ie^^^""^^ p>0 

5. n n + 1 
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6. Goto step 2. 

To apply the diffusion approximation to the AMCMC we define the continuous time 
process Xn(t) for all n > 1 and for all t > for any target distribution 

X„(0) = xoeR; 

Xn('^) = X.(-) + ^^n(-)U^K(^), i=0,l,..., 

\ n n \ n n n n 



Xn{t) = ^n(^), if ^ < t < ^ for some integer i. (2.1) 
Here, ^n(— ) conditionally follows the Bernoulli distribution given by: 

P(^n{ ) = l\Xn{-),9n{-),en{ ) = mm{ / ,1}, 



n n n n J 

and {e„(^)} are all independent A^(0, 1) random variables. The process 6'„(t) is 
defined as : 

^1 = ^„ f- V^^^"^^^-^"^"", i=0,l,..., 

n J \n J 

and 6'„(t) = 6'„(-), if ^ < t < ^ for some integer i. (2.2) 

Where Pn(^) ~ 1 ~ some p > 0. 

It has been observed in an earlier paper (see [1]) that the diffusion governing the 
dynamics of the limiting process is the following 

Theorem 1. (from fJ^) The limit of the process Y„(t) := ^X„(t) 9n(t)j , where 
Xn{t) and 6n{t) is given hy \2.1\ and \2.2\ respectively, is governed by:: 

dYt = b{Yt)dt + a{Yt)dWt, with Y^ = {X,,et)\ (2.3) 



where, 



e^^p'ix,) ^ f e, w{x,)\ 



2 ^{Xt) 'A 



and Wj is a two dimensional Wiener process. 
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3 Main result 

In this paper we concentrate on the case where the target density is standard Normal 
(ie, ip{x) = '^^~~)- Then the diffusion looks like: 

dYt = b{Yt)dt + a{Yt)dWt, where, 
KYt) = J^l^^l))- (3-1) 

and cr(Yj) remains the same. 

Theorem 2. The X -marginal of the invariant distribution of (E2P is N{0, 1). 

Proof: The proof of the above theorem is spread over various subsections. In Section 
13.11 we show that the process [Xt^rjt) where 77* = 1/6'^ is tight. This combined with 
the hypoelliptic condition in Section 13.21 shows that the process admits a invariant 
distributon. The marginal of the invariant distribution is identified as the target 
distribution in Section [3751 

3.1 Tightness of {Xt,T]t) 
We state a simple Lemma. 
Lemma 1. . 

Jp* OudWu is a martingale with respect to Tt = cr^Xg, Os]0 < s < t) and hence 

E{[ OtdWt) = 0. 
Jo 

Proof: It is sufficient to show that the martingale Yt := J^OsdWs is L2-bounded. 
So using Iso's isommetry it is sufficent to show that 

E(^j^ ejds^ < 00 

But, 

de^ = 29tdet 

= 2e^{p-^\Xt\et)dt 

< 2etpdt 
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Now writing Yt = 9^ we have that 

dYt < 2pYtdt 

Multiplying both sides by the integrating factor e~^^^* we get 

d{Yte-^P^') < 
Integrating and taking expectations we get 

{Yte-^^') - (Fo) < el< ele^P' and E{e1) < E{el)e^P\ Wt > 0. 

Also [ elds < dl2p{e^'P' - 1) and hence, E{ [ Olds) < oo, Wt > 0. (3.2) 
Jo Jo 

□ 

Here is the main result of this subsection. 

Lemma 2. Assume E{Xq) and E{6q) are finite. Then, for the coupled system i\3.1\) 

1. supt>o -E'(Xj^) < oo, whenever E{Xq) and E{6q) are finite; 

2. 6t > almost surely; 

3. rjt > almost surely, where rjt = j^; 

4- Joint distribution of {{Xtjrjt) : t > 0} is tight. 
Proof. 

1. 

supE(X^) < oo 
t 

Applying Ito's Rule to Yt = we get 

dYt = 2XtdXt + {l/2)2eldt 

= -XtO^dt + 2XtetdWt + e^dt 
= {-x^e^^ + el)dt + 2XtetdWt 

^dYt + YtOldt = e'^dt + 2XtetdWt. 
Multiplying both sides by e-^o ^ud-^ gg^ 
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Jo Jo 

Jo Jo 



Writing Fg = e-^o ^Idu^ (therefore F,' = O'^Fg) we have ; 
^2 ^ Xl^f^F^ds , J^FMdWs 



Ft Ft Ft 
^0 , Ft -I , J^FMdW, 



Ft Ft Ft 
^ E{Xl) < E{Xl) + 1 + 2E{— / FsXsOsdWs), since > 1 V t > 0. 

(3.3) 

Write Yt = F{s)Xs6sdWs and = = e-^'o^-'^". So the term inside the 
expectation on the right side is YtFt := Zt. Applying Ito's lemma to Zt we 
get, 

dZt = YtdFt + FtdYt + dYtdFt, 

= -YtOfFtdt + FtXtetF{t)dWt + 

= -ZtOldt + XtOtdWt with Zq = 0. (3.4) 

Writing Zt = —Zt we have the SDE of Zt as 

dZt = -Zt9l + XtOtdWt where W^f = -IVi 

Comparing the SDE of Zt and Zt , and noticing that Zq = Zq = 0, we have 
that Zt = Zt in distribution. This implies, Zt is symmetric about 0. Therefore, 

E{Zt) = if we show that E\Zt\ < oo. Note, if E{Zt) = 0, for each t > 0, 
then from (13. 3p . for each t > 0, 

EiX^) < EiX',) + l. (3.5) 

Thus, it suffices to show that Zt has finite expectation for each t > 0. 
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Now 



Zi = ^{f F.e^XJW^i' <( f FA«W^s)' a.s 



ft ,9 , rt 



E{Zf) < E(^j^ FsOsXsdWsY = E(^j^ F^e^X^ds^ from Ito's isommetry 



< E{^j^{F,esYds Xtds 

< [E(^j\FAs)'ds^E(^j^Xtdsy^ 



from Cauchy-Schwarz inequality (3.6) 
Let us define Yt = 9^ for n = 1, 2, ... we have 

dYt = ne'^-^det = ne'^-^(et{p-^-^^)dt) 

< np9^ = npYt 

Multiplying by e""^* we have 

ci(e-"P*Ft) < 
^Yte-^'P'-Yo < 

^Qn^raps ^ Qn^p{ra+n)t for s < t and Tl = 1, 2, . . . , m = 0, 1, . .(3.7) 

Now since 

Xt = Xo- -^ds + / OsdWs 
Jo ^ Jo 

32 rt 



x't < 3(|Xo|^ + (/ ^^d,)2 + (/ e,dw,)') 



2 JO 



\XsW. 

) 2 

^Xf < 27{\Xo\' + i f ^-^dsY + {\ f 9 JW,\r) 

Jo ^ Jo 

Writing Mt := \jQ9sdWs\ which is a (loc) submartingale. Now applying 
Burkholder-Davis-Gundy (BDG) inequality to Mt we have 

E{M^) < E{{ sup Muf) < C4^([M]2) = C^E{ f e^dsf < C^E{el2p{e^P' - l)f 

0<u<t Jo 

= C4el2p{e^P' - 1) (3.8) 
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Again, the equation, d9t = 9t{p — jtit, yields 



= d{e-'P%) + e-P'^^-^^dt 



= e-p%+ l\~p-^\^^ds 



. 27r 

eP%-9t = [\p('-^)\^^ds. 
Jo 



Thus, 



Jo V Stt 







27T 







27r 



ds\) < {eoeP' + et)\ (3.9) 



And, from (^oe^* + < 8(^^e^P* + Of) < 8{eyp' + Oyp') = 169yp\ 

Hence, from (KB and (D 



And 



^(^ X» < 27(E(Xo4)t+((16 + C4/)(e^^*-l)/4p 

-2C4/(e2^* - l)/2p + C,pH)E{e^,)) 
< oo (3.10) 

Jo Jo 

< E{e^ f e''P'e^o<-""''^-ds) 

Jo 

= E{et f e^P^e^^^'^'-'^/^Pds) 
Jo 

< E{9y^oi'^""-^^/P [ e^P'ds) < oo (3.11) 

Jo 

So, from fl3.10p and (13. lip we have: 

E{Z^,) < oo. 

This proves 1. 
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Remark 1. Although we have only proved that the second moment of Xt is 
uniformly bounded, it is shown in Section \3.3\ that all even ordered moments 
(and therefore all moments) are uniformly bounded. 



2. Take Vt = ^. Then 



dVt = -^det -^et{p-^\Xt\et)dt 

tft v27r 



-Vtip--^)dt 



VV2n 

\Xt\ . 
Vtp + ^)dt 



'271 

Multiplying by the integrating factor e^* we get 



d{eP'Vt) = ^^dt 



'2lT 

e'^'Vt-Vo = I'^enXuldu 



/'■ 


1 


Jo 




Voe 


-pt _|_ 



Vt = ^e-^*+ / e-P(*-")^du (3.12) 







. 27r 

t 



^E{Vt) = E{Vo)e-P'+ I e-P^'-'^^^^du 

Jo V 27r 

py2'K 

(since sup^^o -^(l^t I) < Md, for some Mq > 0, depending on ii^(|Xo|)) 
< Ml (depends on E(l/^o), and E(|Xo|)) 

^supE(l/^t) < Ml 

t>o 

Therefore > almost surely. 
3. Define, rjt = l/9t. From the proof of the second part, it follows that 

Vt = Voe~P^ + / e"^^*""^^ ^-^du > 0, whenever ?7o > 0. 
Jo V 27r 

Also, 
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for any t > 0, for each sample points, e''^^*~'^^^-^=du = if only if X„ = 
for almost all u G (0,t] w.r.t. Lebesgue measure. This is impossible, since 
Xu = for almost all u implies dXu = OudWu- Then Xu would be non-zero 
for almost all u, since as 

pu 

On = 77, ^ > (3.13) 

for all u. This contradicts our assumption that X^ = for almost all u. There- 
fore Xu 7^ for n on a set of positive Lebesgue measure. Hence e~^'^'^~'^^^-^=du > 
almost all u. Thus, rjt > a.s. Repeating the same argument will, in fact, 
give Xu 7^ for almost all u G (0, t], for any t > 0. This proves the third part. 

4. This follows from the above where we have shown that supj>Q -EdXtl) < oo 
and supj>o E{l/e t) < oo. Let Ri and i?2 be two positive numbers. Then 

P{\Xt\<Ru\7]t\<R2) = l-P{{\Xt\> Ri)U{\r]t\> R2)) 

> l-iPi\Xt\>R,) + P{\7]t\>R2)) 

> l-E{\Xt\)/Ri-E{\7]t\)/R2 



Hence given any e > we can choose Ri, R2 sufficiently large so that P(|Xf| < 
-Ri, \rit\ < R2) > 1 — e. This proves tightness of {Xt,r]t). 

□ 

3.2 Hypoelliptic condition 

Here we show that the vector fileds corresponding to 13.11 satisfies the Hormanders 
hypoelliptic condition (see the proposition for the statement of the condition). 
Since the condition requires smooth vector fields, we convert the drift and diffusion 
coefficients in 12.31 into smooth vector fields. 
For this purpose define 



where ge{x) — > |x| as e J, in the pointwise is a smooth function, and a{x,ri) = 
1/7] \ 

^ ^ j as the drift and the diffusion coefficient respectively of our new equa- 
tion. Such function can be constructed by convoluting the function |x| with a 
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moUifier. Defining in this fashion \ \b^(-, ■) — ■)|| — )■ as e J, 0. 
Consider a SDE in the Stratonovich form: 



dXt = Ao{Xt)dt + J2MXt)odB^. 



(3.14) 



a 



where / is a smooth function on M and o denotes Stratonovich integral. SDE in 
the Ito and the Startonovich form are interchangeable: for a multidimensional SDE 
given in the Ito's form 



can be readily converted into the Stratonovich form from the following equation: 



where b(t, x) = {bi(t, x))' is the drift term for the Stratonovich form. From the form 
of (j(x,?7) we can say that 6'^ and fe^ are the same. We can identify the drift and 
diffusion coefficients Ao(Xt) and Ai(Xi) can be identified as vector fields in M. 
Here is the condition due to Hormander, ([5]): 

Proposition 1. Let {Aq.Ai, . . . , A„} be n smooth vector fields on R'^. Define the 
Lie Bracket [V, W] (x) between two vector fields V and W as 



where D{V{x)) is the Frechet derivative. The (parabolic) Hormanders hypoelliptic 
condition is satisfied if : 



A,,{y), [A,„(y),A,,(y)] [[A,„(y), A,,(y)], A,,(y)] ■■• [[[[A,,{y), A,,{y)], A,,{y)], A,,{y)], . . . , A,,Xy)] 



spans R'^ for every point y G R'' and any {jo, ji, . . . , j^} G {1, 2, . . . , n} jo < ji, • • • , j; 

Lemma 3. The vector fields A^{y) and Ai{y) satifies Hormanders hypoelliptic con- 
dition [I] 

Proof: Identifying 13.141 with 13.11 we have (writing y = (x, 77)): 



dX, 



h{t,Xt)dXt + a{t,Xt)dWt 




[V, W]{x) = DV{x)W{x) - DW{x)V{x), 



X d 



+ {-pr] + 



9e{x) d 




A(y) 



1 d 



rj dx 
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Therefore vectors corresponding to AQ{y),Ai{y) and [AQ(y), Ai(y)] are 

Since 6 = l/t] > and x ^ almost surely, one obtains that these two vectors span 
TZ^, almost surely, (what about Aq) □ 

It is well known that if the vector fileds Ao(y) and Ai(y) satisfy the above conditions 
then the solution of the SDE (13.141) admits a smooth density (see, [7]). 
Thus, satisfy (parabolic) Hormander's condition and hence even though the original 
diffusion is singular its transition probability has density (see [6]). Again, since the 
coupled diffusion is tight, it admits unique invariant probability by [B]. and it admits 
a density. 

Remark 2. Note that although we are interested in the distribution of X showing 
tightness of the process X only would not suffice since 9t may he a function of 
{Xs'i < s < t}, so marginally X may not he a Markov process. Hence ^xvg E\Xt\ < 
M would give the tightness of X hut it would not he appropriate to tell ahout the 
existence of a unique distribution. 



3.3 Uniform boundedness of moments of Xt 

Lemma 4. For any k > 1, moment of order 2k of Xt is uniformly hounded, ie., 

supE(Xf ) < cx), 
t 

whenever E{Xq^) is finite. 

Proof: Applying Ito's Lemma to Yt = X^^ we get 

(iXf = 2kX^^~^dXt + k{2k-l)Xfe1dt 

= [-kX'^^el + k{2k - l)Xf ^2^,2 j ^-1- 2kX^^^^etdWt 

(^-A;Xf ^2 ^ j^^2k - l)(aXf + h)ei^dt + 2A;Xf "^^tdH/t, 



< 



since for any small a > 0, there exists h large enough such that, x^^ ^ < ax^^ + h Vx. 
Thus, for < a < 1/(2 A; — 1) we have 

rfXf < -Xf ^2 (^j^ _ j^(^2k - l)a) dt + k{2k - l)he1dt + 2kX'^^-^etdWt 

dxf + CkX^^eldt < k{2k - i)heldt + 2kx^,''-^etdWu 
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where Ck = k{l — {2k — l)a) > 0. Multiplying by the integrating factor e*"* -^o ^u*^" := 
F^*" on both sides we get 



< Xf + k{2k - 1)6 [ elF^'^du + 2k f F^'^Xl'^-^OudWu 

Jo Jo 

^Xf < Xl^F^^" + k{2k-l)hFr^'^ ! elF^''du + 2kF^^'' ! F^^Xl^-^e^dW^ 

Jo Jo 



Now, 



/ elF^^du={Ff''-l)/C, 
Jo 

^ E (xf ) < E (f.-^'^X^^^) +e(^—{1- F-^")^ + 2kE (p-^'^ J F^'^Xf-^e^dW^ 

Following the similar notation as before let Ft^k = ^t~'"'°- Also define Zt^k = Et kYt,k 
where Ft,fc = Jq ^u'°-^u'~^^udWu- Applying Ito's lemma to we have 

dZt^k = Yt,kdFt^k + Ft^kdYf^k 



CkYt,ketFt,kdt + Ft,kXt-'etF^'dWt 
-CkZt^k0^dt + X^l-'9tdWt. 



Now, taking Zt,k = -Zt,k, yields 

dZt^k = CkZt,kOldt - X^'f%dWt = -CkZt^keldt + Xl^-HtdWt 

where Wt = —Wt- Since Z^^k = — ^o,/t = = Zo^k- Hence Zt^k and Zt^k has the same 
distribution, i.e., the distribution is symmetric around 0. Therefore to show that 
E[Zt^k) = 0, Vt > one needs to show Zt^k has finite expectation Vt > 0. Thus, 
as before, it is sufficient to show that E{Z^i^) < 00, Vt > 0. Now, 



Zlk = F;I^I^ F^^X^'-%dW,y < F^^X^'^~%dW, 
e(^J^ F^^X^'-%dWsy = Ei^j'^Ff^elXf-^ds 



a.s 



E{Zt\ < 



< Ff^eldsj^ xf-^eldsY <\lE[j^ Ff'eids^E(^j^ xf-^e^ds 

by Cauchy-Schwarz inequality. 
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Again, E{ F fields) = EHF^^"^" - l)/(4Cfe)) < E(e^^^''o-''7(4Cfc)) < oo. 
For Xl'^-^e'^Js we have 



"t -y- n2 pt 



2 jQ 



for some constant > independent of Xt. 
For the second term, from (13. 9p 

/ \X \f)'^ \ 8fc-4 / , N 8fc-4 / \ 8fc-4 

- (/ ^^^) ^ (^) 

(3.15) 

^^^duy'~'ds < {2T:Y~^{ef-' f e^^^-'^^^ds) (3.16) 



pt 



For the third term, write Mj := | 9sdWs\ and apply BDG inequahty: 

8/C-4 / \ 8fc-4 

sup 

^0<s<t 



l4fc-2 



E[Mt) < E sup MJ <Cfeo^ [M] 

/ /■* \ 4fc-2 / /"* \ 4fc-2 

= Cfco^(^y ^^(y ^y'dsj (3.17) 

for some C^o > and for alH > (3.18) 

Combining (I3.16P and (I3.18P we have 

P'ds) 



E(^J^Xf-Xds^ < (clE{X^''-^) + Cl{27r)^''-^E(eo eP'ds 



< oo 



This proves 

EiZl,) < oo 

and hence the lemma. □ 
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3.4 Finiteness of Time average of moments of 9 

In this section C will stand for a generic constant that might take different values 
in different situations. We assume throughout that, for a fixed k > 1, E{Xq^), 
E{9q^) and E{ri^'^) are finite. For non-random initial data this is already assured. 
We proceed sequentially by the following steps: 
Step 1: We have 



Step 2: We show that 



Using 



supE(Xf ) < oo 

t>0 



sup E{ri^) < CO. 
t>o 



□ 



^71 



to get, 



Vt 



2k 



-pt 



* / N IX I \ 2fc 



1 -e-PS2fe 



P 



P^/2tx / Vl-e-P*7o 
where last inequality follows from Jensen's inequality. Thus 

■l-e-P*\2fc/ p 



supE(r^f) < sup22^-^(i?(r^o^'=)e-^'=^*+(^^) (- 



-pt 



< 2'^~^(E{r], 

< 2^^-^[E{'q, 
Hence the result. 



2k 



sup 



p 



VI - e-P* 



I') 



P\/2t\' t>0 ^± — c ' Jo 
1 \ 2A: 



pV2tt^ t>o 



supE(|Xi 



l2k- 



< OO. 



(3.19) 



□ 



Step 3: We now prove 



i 

sup- / E{\Xu\Ou)du < OO. 
t>o t Jo 
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Note 



d{i + et) = det = et{p-\Xt\et/V27r)dt 

= (1 + et){p - \Xt\et/V2^)dt -{p- \Xt\et/V2^)dt 

d{i + et) + {p-^-^^)dt = {i + et){p-\Xt\et/V2^)dt 
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+ \J2tx vStt 

log- / + ^/ < pt + —= \Xu\du 

J- + c^o V 27r Jo V 27r Jo 



I [ iX^lOudu < V2^p+l [ \Xu\du + V2^ 
t Jo * JO 



log(l + ^o) 



Thus, i E{\X^\e^)du < V2^p + i /J + V2^MMi±M) . 

Therefore, using the moment bounds for X, 



sup^ [ E{\X^\eu)du < C 
t>o t Jq 



□ 

Step 4: We now prove by induction, that for any k > 1, 



1 /"* 

sup - / E{ei)du < C. (3.20) 

t>0 t Jq 



Let, as before, rjt = j-^ then drjt = {—prjt + \Xu\/\/27i)dt. 

Applying Ito's lemma to Yt = Xfrjf with k > 1 positive integer, we get 

dY, = 2X,^l^''^dX, + {2-k/2)xU^''^diit + \2rf-''\dXtf 



2Xtm-'''\-^dt + -dWt) + (2 - k/2)Xf7]l-''^\-pr]tdt + ^dt) 
2^r Vt V2n 

2 - k/2 



{-xht'^ - p(2 - km^r'^ + '-^\x,\^vr'' + 



/2tt 

+ 2Xtril-'''^dWt (3.21) 
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Thus, integrating both side from to t and rearranging we get, 

(4 - k)p 



4-fc 



'2^ — 



2 — k/2 /■* /■* 2^ 

' I \X,\\^ dt-2 Xs7]s' dWs 



'2tx 

1 /■* fe 1 4~fc 4-fc 1 /"* fc {A — h\rt 



Jo 

2 



eidt = ^{x'tvt' -x'^rj,- ) + xie}dt+'-^^ 



/ / c^^-T / X,r],' dWs (3.22) 

Jo Jo 



W2tx 



First consider the first, third and fourth term in 13.221 We proceed in steps 
Step 4a: 1 < A; < 2: Since we have proved in the earher steps that the (2k)th 
moment of Xt and rjt is uniformly bounded we, by an application of C-S inequality, 
can say that 

sup^E(X,\'^ + Xo\'^) < C, (3.23) 

sup - [ E{X^7]i^)dt < C, (3.24) 
t>o t Jq 



and 

sup - / \Xs\^r]i^dt < C. (3.25) 
t>o t Jo 



For the fifth term j Xgfjs ^ dWg is a square integrable martingale (by Ito's issom- 
etry) and hence 

E(^X,ri7^yWs = 0. 



□ 
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Step 4b: Consider the second term in 13.221 For any k>l, 

^ f xlelds = \ f\\x,\^^dl){\x,\^^)ds 

Jo Jo 



= (1 1^ E{\X,\97')dt) (1 £ E{\X,\'^^')dt^ 



^ sup] f E{X^el) < fsup^ ri?(|X,|^i^)rft)'''fsup^ rE(|X,|'=+2)rfty+^ 
t>o t Jo ^t>o t Jq ^ ^i>o r Jo ^ 

< C, (3.26) 

We only have to show that the first term on the RHS is finite. We proceed by 
induction : We have proved (Step 3 in this section ) that 

sup^ [ E(\X,\es)dt<C 
t>o t Jo ^ ' 

Therefore from 13.231 13.241 and 13.251 since /c = 1, we have that 

sup- I E(ei)du < C. 
t>o t Jo ^ ^ 

Assume that the hypothesis is true for /c = m — 1, ie., 

supi / E(e7^)ds < C (3.27) 



i>0 t Jq 



sup^ / e(\XsW) < C (3.28) 

t>0 t Jq \ J 

This also imply, 

sup - / &^ds < C. (3.29) 

t>0 t Jq 
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Then, for k = m. 



m — 1 rri — 1 771— 1 -1 rri — 1 m — 1 



2 



vim — 1) "'-i "^+1 ; — 

'-6^ dt - \Xt\et ' /V2^dt 



1 /"* / m + 1 \ \ m-1 1 /"* m+1 

^sup- E[\X,\9s'' ) < V2^sup-E{9^^ ) + sup - / ^ )rfs 

t>0 t Jq \ / t>0 i i>0 t Jo 

< ^<'" sup i /' B + V2iE {om^) 



t>0 I Jo 

By the induction hypothesis 13.271 and 13.281 This proves the proposition. 
Therefore the second term on 13.221 is finite, for any k > 1. □ 
Step 4c: A; > 3. 



For 3 < A; < 4 we can claim similarly as above that: 

sup^i?(X,\'^+Xo\'^) < C 
-t 



snp- [ E{XlT]y)dt < C. 



For 4 < /c: 



sup^e(x,V') = sup^ijfx^^,' 



sup-^fc) supi^f^,^ 
t>o t \ / t>0 t \ 



and 

1 /•* 



sup-/" eIx^t]^) < C by Step 4b. 

i>0 t Jq \ / 



For the fourth term we apply the Holder's inequality with p = k — 1 and q = 
{k-l)/{k-2) to get 

k — 1 k — 1 

^ snpl f E(\X,\'ei^)ds < ^snpl f E(\Xf ('-'Ads + -\snpl f e( 
t>o t Jq \ J k - 2 t>Q t Jq \ J k - 1 t>o t Jq \ 
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Finiteness of the first term lias been proved in Section [3l3l The second term is finite 
by virtue of Step 4b. 

The last step is to show that the fifth term in 13.221 is 0. Now, 



= ^{j^ ^s^s'^ds^ <Chj Step 4b 
Therefore £XsVs' dWs is a square integrable martingale and hence 

e( [ XsV^dWs) = 



for any t > 0. □ 
Steps 4a, 4b and 4c together proves Step 4. □ 

3.5 Identifying the limiting distribution 

In this section we obtain the limiting moment of E{X^^) and show that is equal to 
for any k E Af. Hence by the uniqueness of the moment generating functions 
we can say that the limiting distribution of Xt is A^(0, 1). We proceed by induction 

1. \imt_^EiX^) = 1. 

Applying Ito's lemma to X^ we have 



dx^ = (^-x^e^ + e^yt + 2XtetdWt. 



Writing Fk(t) = k O^ds and multiplying by the IF = e^^^*-' on both sides we 
have 



^(x^e^iW) = e^'^'^e^dt + e^^^'^XAdWt 

^Xl = e-^^W[X2+ re^iW^2^s+ f e^^^'^X,9JWs] 

Jo Jo 

Jo Jo 

= e-^iW[x2 + e^^W-l+ f e^'^'^XsOsdWs] 

Jo 

= XqV^^W + 1 - e-^^W + re^^(^)-^^Wx,MVr, 

Jo 

^E{Xl) = E{e~^'^'^)E{X^) + l-E(^e^^'^'^ J^e^'^-'^X.ejWs^ 
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We have proved in Lemma [2] that the third expectation is sero. Therefore 

E{X^) = E{e-^'^'^)E{Xl) + 1 - hm E(e-^^W) 
hm E{X^) = E{Xl) hm E{e-^'^'^) + 1 - hm E{e-^'^'^) (3.31) 



Therefore, 



where C = supj>Q-E'(j jl^lds) < oo from Step 2 of Section [231 So 
hm E(e~^iW) = ^ hm ^(X^) = 1 fromESD 



2. Assume this to be true for fc — 1, ie., hm^^oo E{Xf"^) = for 1 < m < k — 1. 

3. hm^^E(Xf) = g. 
We have that 

dXf = {--kXl^el + fc(2A: - l)Xf -2^2 j rft + kXl^-^OtdWt. 
Muhiplying with the IF = e^'^^*-' we have that 

^Xf = e-^^^\xf + {2k-l) f ke^^'^^'^Xf-^elds + k f e^^^^'^Xf-^ejWs] 

Jo Jo 

^ ) = E(e-^^W)E(X2'=) + (2A; - 1)E{ I ke-^^^'^e^^^'^Xf-^Olds) 

Jo 
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We have proved in Lemma H] that the third expectation is zero. Therefore 
hm ^(Xf ) = E{X^'') hm ^(6"^'=^) 

t—>oo t—>ca 

+ {2k - 1) hm E{ [ A;e-^*We^*(^)xf 

Arguing in the same way as the case for Fi{t): 

Fm 1 1 1 



1 k r 

< -- / 7]^,ds hm E(e-^^(*)) = 0. 

t t In t^oo 



Define, for < m < — 1, 



The term inside the expectation is 

Jo Jo 

= X,'-e^^(*) - X^"" - e^^(^) {{-mXl'^el + m(2m - l)Xl'^-^el)ds 

+ ! 2mX^'^-^dsdWs 







io Jo 



— e 



ft 

-Fk{t) 



(3.32) 



And so, 



m(2m — 1) . , , 

- ^^fc,2„^-2 + 0. 3.33 
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Defining B^^rn = limt-!>oo ^fc,m(^) (the limit exists since the integrand is a non negative 
quantity). Applying limits on both sides of 13.331 we get 

R _ pr^2mN n-j-"'R m(2m - 1) 

-Dfc,2m — lim i^l^Aj j— U+— i3fc2m. ; -Dfc,2m-2 

=^1-1- r)^k,2m — j ; -Dfc,2m-2 

- ^ linn F(Y^-^^ m(2m - 1) 
k — m t^oo k — m 

Substituing different values of m = 0, 1, 2, . . . , /c — 1 we get 

Bkfl = 1 

k 1 

Bk2 = 1-1 1 = 1 

'''^ k-1 k-1 

k 2.3 

A = 3 — 1 = 3 

k-2 k-2 

Bk 6 = T-^5.3 - -^3 = 5.3 
k-3 k-3 

k 4 7 

Bk,s = -7.5.3-— -5.3 = 7.5.3 

k — 4 k — 4 

Bk,2k-2 = k{2k-3){2k-5)...3.1-{k-l){2k-l)Bk,2k-i 

= k{2k - 3)(2/t - 5) . . . 3.1 - {k - l){2k -?,).. . 3.1 = {2k - 3)(2fc - 5) . . . 3.1 
(2A;-2)! 
2'=-i(A;- 1)! 

From equation 13.321 : 

limE(Xf) = (2fc - 1)5,, 2fc-2 

t— >oo 

^ ^2'=-i('^- 1)! 2fcfc! 

Ml 
2^A;! 

To find the odd moments of Xt we perform the same procedure as above. We have 



n2 

dXt = -Xt—dt + OtdWt 
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Define Gi{t) = | j^Olds. Multiply by the IF=e'^i*^*) on both sides we have 

Applying the same argumenst as above we can show that 

lim E(e"^^(*)) = 

and so 

lim E{Xt) = 0. 

t—^oo 

Now we apply Mathematical Induction. Let us assume that 

lim = 

t—^oo 

where m = 1, 2, . . . , fc is any positive integer. Applying Ito's lemma to Xf'''^^ we get 
rf(Xf+i) = {2k + l)X^''dXt + ^{2k + l)2kX^''-^9^dt 

= (2k + l)Xf [-Xt^dt + ^trfW^t j + (2A; + \)kelXf-^dt 

= (-^(2/e + l)Xf +1^2 ^ ^2A; + l)A;Xf "^^^jdi + {2k + l)^^^^^!^^ 

Define Gfc(t) = ^^'^ j^O^ds and the integrating factor as e'^''*^*^. Multiplying by it 
on both sides of the above equation we get: 

^Xf+i = e-^^W [x^'^+i + A;(2fc + 1) f e^'''^'^lXf'~^ds + {2k + l) f e^^^'^9,XfdWs 

'-Jo Jo - 

= E(e-^^W)E(X2'=+i) + E(A;(2A; + l)e-^^W^ e^'^^^^^X'" "'^^s) 
+ (2A; + l)E(^e-^'=(*)^ e^^(^)xfMVr,) (3.34) 
We now try to prove that the third expectation is zero. 
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Following the similar notation as before let Fk{t) = e "^fe^*). Also define Zk{t) = 
Fk{t)Yk{t) where Yk{t) = e^^^'^X^^esdWs. Applying Ito's lemma to Zk{t) we have 

dZk{t) = Yk{t)dFk{t) + F,{t)dY,{t) 

= ^Ykit)Fk{t)e^dt + Fkit)Fk{t)X^%dWt 

2k A- 1 

Zk{t)e^dt + x^%dWt 



2 

Now, taking Zk{t) = —Zk{t), yields 



2k + 1 

dZkit) = -^Zk{t)e1dt + xlHWt 



2k + l ~ 



Zk{t)e1dt - xf^dWt 



where Wt = —Wt. Since Zq^^ = — ^o,fc = = Zo^k- Hence Zk(t) and Zk(t) has the 
same distribution, i.e., the distribution is symmetric around 0. Therefore to show 
that E{Zt.k) = 0, Vt > one needs to show Zt^k has finite expectation Vt > 0. 
Thus, as before, it is sufficient to show that E{Zk{tY) < oo, Wt > 0. Now, 

Zkity = Fk{tfYk{tf = Fu{tf(^j\''''^'^XsesdWsy 
^E{Zk{tf) < Ei^j\^''>^^'^Xfelds^ 



Now, as in above 



[j^e'^^^^^elds) = ^^^^(eWmt) _ 1) ^here F(t) = 
< C ^ E(^j\^''^^'^elds^ < oo 



Now, one can write 
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The second and the third term can be shown to be finite (the details are same as in 
13.31 expect that the exponent of Xt is 8k, so it is not repeated here). Therefore 



e(^J^ Xfelds^ < oo ^ E{Zk{tf) < C 
This proves that E{Zk(t)) = 0. Therefore from 13.35) 

hm +^) = hm E(e^^^(*))E(X2'=+i) + hm E(k{2k + l)e^^''W / e^^^'^eX'^^^ds 



Writing 



Bk,2m-i{t) = E(2k{2k + l)/2e^^^^'^ e^'^^'^OX'^-^ds^ =^(^2^6"^'=^^ X^— M(e^''(^ 



we have 



hm E(Xf 

t— >-oo 



hm E(e-^^W) + hm Bk,2k-i{t) 



hm Bk,2k-i{t) 

t^oa 



(3.35) 



Now /Jxf'"-M(e^^('')) 



y-2m-l Gk{t) _ y2m-l 



(2m - l)Xl'^-'dX, 



1 



+ |(2m-l)(2m-2)X2'"-3^2^s 



v2m-l Gk{t) _ -v-2m-l 



nt , q2 

/ e^^W {2m-l)Xl'^-^[-Xs^ds + e, 
Jo \ 2 



-(2m - l)(2m - 2)Xl'^^'^elds 



-(2m-l)(2m-2) [ e^'^^'^Xf"'-^9l)ds - [ {2m - l)e^''^''^ X^'^-^OsdWs 
2 Jo ^ Jo 
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Therefore Bk^2m~i{t) 



(2^ - 1)(2^ - 2)^^^.,^,) / X-3^e-^(^)^?rf. 



2k + 1 Jq ' 2 

2777 _ 1 /"* 

2k -\- 1 Jq 



(2m- l)(2m-2), 
2A; + 1 



2?77 — 1 

= E(2A;X,2'"-^) - i?(Xo^™-^)i5;(e"«'=W) + 
(2m-l)(2m-2) ^ 

Writing 5^^- as the hmit of Bkj(t) we have 

H - n n I ^^"^ P (2m-l)(2m-2) 

2A; + 1 2k + 1 

^ (2A: + 2 - 2m)5fe,2m,_i = -(2m - l)(2m - 2)5^,2^-3- (3.36) 

Now 

Bk,i{t) = E(2A;e-^'=W / XJ{e^^^''>))ds 

Jo 

= i?(2fce-^^« [x^e^^W ~ ~ Jo ^'''^'^'^^^] ) 

= E{2kXt) - 2kE{Xoe-^'^'^) - E(2ke^^'^'^ e^^^'^ -xf^ds + O^dW^ ) 

= E(2A;Xt)-2A;E(Xoe-^*W) + — ^E(2A;e-^*(*) / X,rf(e^'=(^))) 

2/c + 1 V Jq / 

^ Bki = - + — ^ — 5fc 1 
2k + 1 ' 

^Bu,i = 

This and Eqn fl3.36l) would imply that 

Bk,2m-i = for m = 1,2, . . . , /c 



Therefore from 13.35] we have 

hm E(Xf +1) 



t—>oo 



Therefore combining the two results we have E{Xl] 



28 



^ when r = 2k 
when r = 2k + 1 



4 Conclusion 



Verifying Roberts et al's condition (see |9]) for checking the ergocity of an AMCMC 
can sometimes prove difficult. In the companion paper (see [I]), we considered an 
AMCMC with the proposal kernel dependent on the previously generated sample 
and an arbitrary target distribution. There we performed a diffusion approximation 
technique to look at the continuous time version of the discrete chain. In this paper 
we narrowed down to the case where the target distribution is standard Normal. We 
investigate whether the invariant distribution of the diffusion is indeed the target 
distribution. It turns that the resulting diffusion (which although singular) admits 
an invariant distribution. Then identifying the limiting moments of 2k order of Xf 
we identify the limiting distribution to be A^(0, 1). 

The techniques applied here are specific only when the target distribution is Nor- 
mal. We hope that this can also be extended to other target distributions, where a 
identification of the limiting moments is possible. Also more choices of the proposal 
distribution can be made, where the kernel is dependent on a finite (or possibly 
infinite) past. We plan to take up these issues in our future work. 
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